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Although the roles of data centers and computing centers are becoming more and more important, and 
on-line research is becoming the mainstream for astronomy, individual research based on locally hosted 
data is still very common. With the increase of personal storage capacity, it is easy to find hundreds to 
thousands of FITS files in the personal computer of an astrophysicist. Because Flexible Image Transport 
System (FITS) is a professional data format initiated by astronomers and used mainly in the small com- 
munity, data management toolkits for FITS files are very few. Astronomers need a powerful tool to help 
them manage their local astronomical data. Although Virtual Observatory (VO) is a network oriented 
astronomical research environment, its applications and related technologies provide useful solutions 
to enhance the management and utilization of astronomical data hosted in an astronomer’s personal 
computer. FITSManager is such a tool to provide astronomers an efficient management and utilization 
of their local data, bringing VO to astronomers in a seamless and transparent way. FITSManager provides 
fruitful functions for FITS file management, like thumbnail, preview, type dependent icons, header key- 
word indexing and search, collaborated working with other tools and on-line services, and so on. The 
development of the FITSManager is an effort to fill the gap between management and analysis of astro- 


nomical data. 


© 2011 Elsevier B.V. All rights reserved. 


1. Introduction 


Astronomy depends upon observations, which are recorded 
with naked eyes, films or modern digital detectors. Scientific data, 
including observational data and other kinds of data, for example 
simulation results, are basic resources for astronomy research. 
During the last few decades, with the development of detectors 
and information technologies, astronomers’ data harvesting abili- 
ties have increased sharply. Astronomy is facing a data deluge, 
and traditional data processing and analysis are facing big chal- 
lenges (Szalay and Gray, 2001; Bell et al., 2009; Hey et al., 2009). 
As one of the solutions for these challenges, Virtual Observatory 
(VO) was initiated around 2000 and later an international working 
environment (IVOA, the International Virtual Observatory Alli- 
ance)! was established in 2002. 

A Virtual Observatory is a data-intensive on-line astronomical 
research and education environment, taking advantages of ad- 
vanced information technologies to achieve seamless, global access 
to astronomical information (Cui and Zhao, 2008). The power of 
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the World Wide Web is its transparency. It is as if all the docu- 
ments in the world are inside your PC. The idea of the Virtual 
Observatory is to achieve the same transparency for astronomical 
data and other related information (Quinn et al., 2004). 

Scientific data available for the whole community are increasing 
sharply. At the same time, taking advantages of fast network and 
cheap storage devices, personal collections are also increasing rap- 
idly. For an astronomer working on observational astronomy, it is 
very common to find hundreds or thousands of FITS files in his per- 
sonal laptop. These astronomers often feel it is difficult or very 
hard to look for a specific file from large number of directories 
and files, and what is worse is that they can hardly find any tool 
to help them. 

During the last decades, several popular data processing and ana- 
lyzing packages, i.e. IRAF,2 ESO-MIDAS,? SAOImage DS9,* Aladin,” 
fv,° et al., have been developed to meet astronomers’ requirements, 
but none of them provides powerful data management functions. 


<http://iraf.noao.edu/> 
<http://www.eso.org/sci/software/esomidas/> 
<http://hea-www.harvard.edu/RD/ds9/> 
<http://aladin.u-strasbg.fr/> 
<http://heasarc.nasa.gov/ftools/fv/> 
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Fig. 1. Main interface of FITSManager. 


FITS is the most common data format for astronomers (Wells et al., 
1981; Hanisch et al., 2001), but it is used only in astronomy and a 
very few close related fields. Its user base is a very small community. 
For general image formats, i.e. JPG, GIF, TIF, etc. there are quite a few 
image management applications, freeware or commercial ones, for 
example ACD see,’ Google Picasa,® and so on. Unfortunately, none 
of these supports files in FITS format very well. 

In professional data centers, database management systems 
(DBMS) and data grid technologies are used to manage large num- 
ber of data files and their metadata. Finding and using such DBMS 
and professional data management technologies are difficult mat- 
ters for most of astronomers. Thus, there is a gap between data 
analysis and data management for personal users. FITS Manager® 
(FITSManager) is just the tool, aiming to provide powerful and 
easy-of-use management functions for individual astronomers, and 
at the same time leveraging these functions with data analysis tools 
and services, including not only VO-ready ones but also legacy appli- 
cations. Compared to professional data analysis packages, i.e. IRAF, 
MIDAS and IDL Astronomy User’s Library,!° the role of the FITSMan- 
ager is very similar to the role of Google Picasa to Adobe 
Photoshop." 

The mission of the FITSManager is to provide individual astron- 
omers an efficient management and utilization of large collections 
of local stored FITS files, bringing VO to them in a seamless and 
transparent way.!? In the following sections of the paper, we first 
describe functions and key features of the FITSManager, and then 
introduce the architecture and technical highlights of the software. 


<http://www.acdsee.com/> 

<http://picasa.google.com/> 

<http://fm.china-vo.org/> 

<http://idlastro.gsfc.nasa.gov/> 

<http://www.adobe.com/products/photoshop/> 

Virtual Astronomical Observatory Project Execution Plan. November 2010, 
version 1.1. p21. <http://www.usvao.org/documents/ 
Virtual_Astronomical_Observatory_Project_Execution_Plan_v1.1.pdf>. 


In the last part, we discuss future trends of astronomy data manage- 
ment and utilization, and summarize the development and future 
plan for the application. 


2. Key features and functions 


Professional data analysis packages and toolkits provide power- 
ful data processing, analyzing, and visualizing functions for FITS 
files, while providing at best very limited management functions. 
Users have to manage and locate their FITS files using directories 
and filenames. When the number of FITS files is small, these ways 
are reasonable and acceptable. When an astronomer has to manage 
thousands to tens of thousands of FITS files in his computer, it will 
be hard and time consuming for him to find a specific file. 

FITSManager focuses on management of small and moderately 
large number of FITS files hosted in astronomer’s personal computer, 
providing index, search, preview, and external linked functions. Ona 
recently produced laptop with 2 GB or more memory, it is feasible 
for FITSManager to take care of thousands of FITS files. When the col- 
lection reaches to tens of thousands of files or more, data center level 
solutions are recommended. The main interface is shown in Fig. 1. 
Key features and functions of the current release include: 


e File search based on FITS header keywords and other metadata. 
Two ways are provided in the current version, general search 
and advanced search. For general search, a Google-like form is 
provided as shown in Fig. 2. FITS header content and filenames 
in the current file list will be used to match the search phrase. 
Advanced search is a database based way. Before using this 
function, header keyword information of FITS files under a given 
directory or a whole file system has to be imported into a data- 
base system. Graphics User Interface (GUI) wizards are provided 
for both FITS header archiving and FITS file search. The file 
search interface and back-end programs will be improved in 
the future versions with guidance of user’s feedback. 
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Fig. 2. File Search Dialog. 
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Fig. 3. Relationships among core plug-in, application plug-ins and OSGi framework. 


e Grouping, annotating and rating. The FITS files hosted in the 
user’s computer can be grouped into different categories by 
their content, type or other characters. Annotation and 5-star 
rating can be added for selected FITS files, and these are search- 
able later. Being similar to functions provided by many other 
image management applications, all of these functions are use- 
ful for file management. 

Thumbnail and preview. FITSManager explorer provides several 
views for files and directories, including thumbnails, titles, 
icons, list and detail. In titles and icons modes, the FITSManager 
provides different icons to distinguish different FITS file types, 
i.e. image, random groups, multiple HDU (Header and Data 
Unit) FITS. In thumbnails mode, the software creates and dis- 
plays thumbnails for several typical FITS format files. In the cur- 
rent release, table, 2D image and 1D spectrum types are 
supported. In future releases, thumbnail function for more 
types of FITS files will be provided. This function is very helpful 
for rapid inspection and locating of FITS files containing image 
and spectra data. 

Interoperability with VO tools. IVOA VOTable?? data format and 
SAMP'* messaging are supported by the FITSManager. IVOA 
members have recognized that building a monolithic tool that 
attempts to fulfill all the requirements of all users is impractical, 
and it is a better use of the limited resources to enable individual 
tools to work together efficiently. SAMP is an IVOA messaging 
protocol that enables astronomy software tools to interoperate 


13 <http://ivoa.net/Documents/VOTable/> 
14 <http://ivoa.net/Documents/latest/SAMP.html> 


Fig. 4. Logical architecture of FITSManager. 


and communicate. As mentioned above, FITSManager focuses 
itself on file management, but leaves other functions, i.e. process- 
ing, visualization, analyze, statistics et al., to other existing and 
new developed tools. Selected files in the FITSManager can be 
sent to SAMP-ready applications, for example Aladin, Topcat,!° 
SAOImage DS9, Worldwide Telescope! (WWT), and so forth. 

e Easy-to-use graphic user interface. FITSManager is an end user 
application, so a friendly user interface is very important. FITS- 
Manager explorer looks like MS Windows File Explore. An 
astronomer can browse his directory and file tree using the 
same manner as using file manager provided by his operating 
system (OS). When a FITS file is selected, some specific func- 
tions will be available. For files in other formats, the navigation 
process is similar to using an OS file manager. 


3. Architecture and technical highlights 

During the design and development of the FITSManager, ad- 
vanced development modes were adopted and latest VO functions 
were considered. 
3.1. Plug-in based architecture 

Plug-in development mode is a very popular way for software 


development nowadays. Many well-known packages, for example 


13 <http://www.star.bris.ac.uk/~mbt/topcat/> 
16 <http://www.worldwidetelescope.org/> 
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Fig. 5. FITS file grouping and rating window. 


Firefox,” Eclipse! and many content management systems (CMS), 
are developed based on plug-ins. 

Eclipse with Equinox!® is used as the developing environment. 
Open Service Gateway Initiative (OSGi) technology is the dynamic 
module system for Java, which enables the modular assembly of 
software built with Java technology. From a code point of view, Equi- 
nox is an implementation of the OSGi?° framework specification, a 
set of bundles that implement various optional OSGi services and 
other infrastructure for running OSGi technology based systems. 
On the other hand, Eclipse provides powerful plug-in development 
environment for the OSGi framework, which makes plug-in coding 
much easier. 

In the FITSManager, all of the specific functions are realized as 
dynamic modules, i.e. plug-ins. A core plug-in provides common 
interfaces for other application plug-ins, and application plug-ins 
provide the dedicated functions mentioned in the above section. 

In the OGSi framework, plug-ins discovery, load and execute are 
all completed by the framework. Comparing to the framework, all 
the attached plug-ins have equal roles. However, for the FITSMan- 
ager package, there is a plug-in to provide basic functions. We call 
this plug-in as core plug-in, and other plug-ins as application plug- 
ins. Although the FITSManager application plug-ins are loaded by 
the OSGi framework physically, their functions depend on the core 
plug-in. Relationships among core plug-in, application plug-ins 
and the OGSi framework are shown in Fig. 3. 

When basic control functions of application plug-ins, for exam- 
ple, start/stop, are integrated into the core plug-in, the backend 
OSGi framework can be ignored in logical. Architecture of the FITS- 
Manager will be simplified as in Fig. 4, consisting of a core and a 
bundle of application plug-ins. 


17 <http://www.mozilla.com/en-US/firefox/> 
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FITSManager core plug-in is the kernel for the application. It 
controls start and stop of application plug-ins, and provides exten- 
sion interfaces to them. The main functions of the core plug-in in- 
clude software configuration, I/O interface management, database 
connection, different views of file system, etc. 

Several basic application plug-ins have been provided: 


e Explorer, a file manager similar to those provided by many OS 
distributions, for instance MS Windows and various Linux sys- 
tems (Fig. 1). 

e Category, enable user to create custom groups for FITS files, to 
annotate and rate these files (Fig. 5). 

e Favorites, an index view of selected file system directories 
(Fig. 6). Header information of FITS files under these directories 
will be imported into databases and indexed. And then fast FITS 
file search on FITS header keywords is enabled. 

e IconCreater, return different sets of icons for current file system 
content to the FITSManager core plug-in according its request 
(Fig. 7). Five types of views, thumbnails, icons, tiles, list, detail, 
are supported. A worthwhile point we should mention here is 
that the IconCreater plug-in will read and analyze header infor- 
mation of FITS files and create different icons for them accord- 
ing to their FITS types, ie. image, multiple HDU, etc. 
Additionally, the plug-in also act at a FITS viewer (Fig. 8). 


When the core plug-in requests a thumbnail view, the IconCreat- 
er loads the first data HDU of a FITS file into memory and return a 
thumbnail for the file. If values of NAXIS1 and NAXIS2 keywords in 
the FITS header of a file are not in the same magnitude level, for 
example, several vs thousands, the IconCreater plug-in will assume 
the content of the file is a spectrum, and plot the first line as its 
thumbnail. If the FITS file is an image, a thumbnail will return after 
essential processes to the first data HDU, for example cutout, shar- 
pen, and contrast adjustment. For table FITS files, a blank table frame 
with defined rows and columns by NAXIS1 and NAXIS2 keywords 
will be returned as thumbnails. 
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Fig. 6. Index and search based on FITS header keywords. 


Creating thumbnails on the fly for FITS files is a new function 
brought forward by the application. When an astronomer is look- 
ing for FITS files with significant characters among a number of 
files, this function will be very useful. 


e SAMP, supports message exchange between FITSManager and 
other SAMP-ready tools, for example Aladin, Topcat, VOPlot, 
SAOImage DS9, WWT, etc. (Fig. 9), which links them into an 
astronomical data management and utilization environment. 


3.2. Other technical highlights 


FITSManager is developed in Java, so operating system indepen- 
dence is a natural character of it. Although database systems are 
very important and popular for developers, they are not so familiar 
to astronomers, who are the end users of the package. So in the 
FITSManager, an embedded Java database management system, 
Apache Derby,?! is adopted. For embedded database component, 
installation and configuration are not required. When astronomers 
are using flexible index and search functions, they would not no- 
tice that a database system is running at the back end. If a user 
needs to use a standalone database system, FITSManager also pro- 
vides interface to connect with it. 


3.3. Performance test 


Performance of two key features of the application was tested 
using data of one spectroscopic plate from SDSS DR7.? The test 
was conducted on a Lenovo ThinkPad T410i laptop with Intel (R) 
Core (TM) i5 CPU (M430 @ 2.27 GHz), 4 GB RAM, Microsoft Windows 
7 32 bit professional OS, Western Digital WD3200BEVT-08A23T1 
ATA hard disk. 

In the plate 1529-52930 of tile 9435, there are 640 1d spec- 
trum FITS files at “spSpec” subdirectory and 640 image FITS files 
at “spAtlas” subdirectory. In “thumbnail” view mode, it takes less 
than one second to create thumbnails for the 640 spectrum or im- 
age FITS files in one directory. 


21 <http://db.apache.org/derby/> 
22 <http://www.sdss.org/dr7/> 


It takes a longer time to create index for FITS files in a given 
directory. For the instance of the SDSS spectroscopic plate 1529- 
52930, it takes about 10 min to import FITS header content of 
640 spectra files into FITSManager embedded database, which 
means it takes about 1 s per file. When the FITS header is simpler, 
the process will be faster. Fortunately, this operation is not exe- 
cuted very often. Once an index is created, it can be used for a long 
time if there is no change in the directory. 

The above results are obtained when the laptop is mainly used 
by the FITSManager. If there are other applications running at the 
same time, time needed for thumbnail creation and indexing will 
be longer. 


4. Summary 


With the arrival of Cloud computing,” it seems that everything 
will appear as a service (XaaS), borders between local storage and 
remote storage, between local applications and on-line services 
become more and more blur, which is changing our way of data 
management and utilization. More and more research activities 
will take advantages of on-line services. On the other hand, per- 
sonal computers and laptops will keep their role for quite a long 
time as the dominant working platform for a scientist, and offline 
research should not be ignored. FITSManager is such a tool that 
brings VO functions and online services to today’s astronomers 
in a seamless and transparent way, no requirement for its user 
to take care of the physical locations of these functions and 
services. 

In the future versions of the software, it will be enhanced in the 
following manner. 


e Deeper involvement with VO and on-line services. Ground- 
breaking applications and on-line services appear from time 
to time, for example “astrometry.net” (Lang et al., 2010). Taking 
advantages of the OSGi module developing environment, plug- 
ins can be developed for these smart services and integrated 


23 <http://en.wikipedia.org/wiki/Cloud_computing> 
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Fig. 7. Different views of data files, (a) shows “Icons” view and (b) shows “Thumbnails” view. 


into the FITSManager seamlessly. Plug-ins for astrometry.net, 
VOSpace** and Montage”? are planned for the next release. 


24 <http://www.ivoa.net/Documents/VOSpace/> 
25 <http://montage.ipac.caltech.edu/> 


e Developer and user community. The OSGi technology provides 
the standardized primitives that allow applications to be con- 
structed from small, reusable and collaborative components. 
These components can be composed into an application and 
deployed. Furthermore, dynamic module development mode 
enables a loose collaboration way, i.e. developer community. 
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Fig. 8. View FITS file content and its thumbnail. (a) Shows the content of a selected FITS file in (b). 


A FITSManager developer and user community is planned. Third 
party developers and end users can develop their own plug-ins 
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and contribute to the whole community, which will enrich 
functions of the application rapidly. 
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(a) Select a file and send it to other 
applications through SAMP 


[File Edit View Frame Bin Zoom Scale Color Region WCS Analysis Help 
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(c) File received by Aladin 


Fig. 9. Data exchange between FITSManager and SAOImage DS9 and Aladin. 


e Enhanced thumbnail and preview functions. On one hand, more 
types of FITS files will be supported, for example multi-fiber 
spectrum, data cube, etc. On the other hand, optimized thumb- 


nails and previews are tried to provided, which need consider- 

ation of many aspects of the data files and intelligent analysis 

for the content. 
e Easy-of-use user interface. During the past few decades, many 
professional astronomers have been used to command line 
interfaces (CLI) coming with astronomical data process plat- 
forms, i.e. IRAF, ESO-MIDAS, and Unix like operating systems. 
Nowadays, network is stepping into service oriented and cloud 
computing era. Advanced graphic user interfaces, for example, 
the Multi-Touch screen from Apple iPad,?° have acquired dom- 
inant positions as user interfaces, and have been accepted 
widely by the younger generation. MSR Worldwide Telescope 
is a successful example on user interface design in astronomical 
software. For FITSManager, one of a number of astronomical 
tools, a user friendly and easy-of-use GUI is very important. In 
the future versions, menu layout and window theme will be 
redesigned. The file search dialog will be greatly improved, pro- 
viding a much more friendly appearance. 
Comprehensive documentation and help system. Documenta- 
tion and help system are very important for users, especially 
for a new application. FITSManager user manual is under prep- 
aration, and a searchable “Help” button will be implemented in 
the tool. 


Acknowledgement 


FlTSManager is a collaborated project between Chinese Virtual 
Observatory (China-VO) and Virtual Observatory India (VO-India). 
This paper is funded by National Natural Science Foundation of 
China (10820002, 60920010, 90912005), Ministry of Science and 
Technology of China (BSDN2009-07), Beijing Municipal Science 
and technology Commission (2007A085). VO-India is a collabora- 
tion between the Inter-University Centre for Astronomy and Astro- 
physics and Persistent Systems Ltd., and is partially funded by the 
Ministry of Communications and Information technology of the 
Government of India. The first author thanks Robert Hanisch 
(STScI), Alex Szalay (JHU) and Mark Allen (CDS) for their helpful 
discussion. 


References 


Bell, G., Hey, T., Szalay, A., 2009. Science 323 (591), 1297. doi:10.1126/science.- 
1170411. 

Cui, C.Z., Zhao, Y.H., 2008. Worldwide R&D of Virtual Observatory. In: Jin, W.J., 
Platais, I., Perryman, M.A.C. (Eds.), A Giant Step: from Milli- to Micro-arcsecond 
Astrometry, Proceedings of the International Astronomical Union (2007) 
Symposium $248, 3, 563-564. 

Hanisch, R.J., Farris, A., Greisen, E.W., Pence, W.D., Schlesinger, B.M., Teuben, P.J., 
Thompson, R.W., Warnock, A., 2001. A&A 376, 359. 

Hey, T., Tansley, S., Tolle, K., 2009. The Fourth-Paradigm: Data-Intensive Scientific 
Discovery, second ed. Microsoft Research, Redmond, WA. 

Lang, D., Hogg, D.W., Mierle, K., Blanton, M., Roweis, S., 2010. AJ 139, 1782. 
doi:10.1088/0004-6256/139/5/1782. 

Quinn, P.J., Barnes, D.G., Csabai, I., Cui, C., Genova, F., Hanisch, B., Kembhavi, A., Kim, 
S.C., Lawrence, A., Malkov, O., Ohishi, M., Pasian, F., Schade, D., Voges, W., 2004. 
The International Virtual Observatory Alliance: recent technical developments 
and the road ahead. In: Oschmann, J.M., Glasgow, Jr. (Eds.), Ground-based 
Telescopes, Proceedings of the SPIE, 5493, 137-145. 

Szalay, A., Gray, J., 2001. Science 293 (5537), 2037. doi:10.1126/science.- 
293.5537.2037. 

Wells, D.C., Greisen, E.W., Harten, R.H., 1981. A&A Supplement Series 44, 363. 


26 <http://www.apple.com/ipad/design/> 


